Visualizing Inference
نویسندگان
چکیده
Graphical visualization has demonstrated enormous power in helping people to understand complexity in many branches of science. But, curiously, AI has been slow to pick up on the power of visualization. Alar is a visualization system intended to help people understand and control symbolic inference. Alar presents dynamically controllable node-and-arc graphs of concepts, and of assertions both supplied to the system and inferred. Alar is useful in quality assurance of knowledge bases (finding false, vague, or misleading statements; or missing assertions). It is also useful in tuning parameters of inference, especially how “liberal vs. conservative” the inference is (trading off the desire to maximize the power of inference versus the risk of making incorrect inferences). We present a typical scenario of using Alar to debug a knowledge base. Visualizing concepts and assertions in AI We present Alar, a visualization system for a large commonsense ontology and knowledge base, ConceptNet, and its associated heuristic inference technique, AnalogySpace [Speer et al 08]. Alar can visualize both graphs of concepts, and also graphs of assertions. Alar is based on display of dynamic node-and-arc graphs, dynamically adjusting using the force-directed layout of the visualization toolkit D3JS [Bostock 14]. In the Concept view, nodes represent Concepts (like elements of an ontology), and links represent similarity between concepts. Link thickness represents the degree of similarity, and lines exert a proportional spring-like “force” in the dynamic graph pulling its nodes closer (working against a repelling force that spaces out the nodes). Concepts with similar meanings will be seen to cluster together. Words with, say, more than one meaning, will find themselves pulled between clusters that represent each of their meaning contexts. ____________________________ Copyright © 2015, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved. In the Assertion view, nodes represent assertions, a triple of Concept-Relation-Concept (e.g. “Fork UsedFor Eating”). Links represent similarity of assertions (not necessarily that one assertion logically implies another, although for a link between an assertion in the knowledge base (black) and an inferred assertion (green), it is usually the case that it exerts a strong influence. The size of the dot indicates its truth value. We believe the visualization of assertion graphs to be particularly novel. Visualization of related sets of assertions can be a powerful tool in debugging knowledge bases and inference. The assumption is that the inference space has a kind of smoothness characteristic – inference about similar concepts should have similar truth values. When incorrect assertions are inferred, they are often connected to incorrect assertions in the knowledge base, so can be easily spotted. Discontinuities in the assertion space can also be a clue that some important knowledge is missing from the knowledge base. Finally, incorrect assertions can also appear when inference is “too liberal” – it concluded something without sufficient evidence. Figure 1. An Alar visualization, centered on the assertion “Orange is a food”. Inferred assertions (green) use related knowledge about food to infer new assertions, e.g. “Orange AtLocation grocery store”. Similarity of concepts and assertions are computed using the analogical heuristic reasoning technique AnalogySpace [Speer et al 08]. It works by making a matrix of concepts vs. “features” (relation + concept), and taking the principal components of this matrix. Such components often represent important semantic distinctions, such as “good vs. bad”. A previous visualization for ConceptNet displayed these components as axes in a multidimensional space [Speer et al 10]. Alar’s visualization technique should be applicable to other systems based on concepts and inferred assertions, and that have a “liberal vs. conservative” control parameter. Debugging inference with Alar There are three interactive controls over the visualization, shown in Figure 2. First, dimensionality, which controls how “liberal” or “conservative” the inference is. For concepts, liberal inference results in more similarity links; for assertions, more inferences. Spacing supplies “negative gravity” making semantic clusters more readable. The link strength is a movable slider on a histogram of number of links vs. strength. Only links to the right of the slider are displayed, giving control over the level of detail. The interface is seeded with one or more initial concepts (e.g. “Orange”) or assertions (“Orange is a food”). An operation, “Add Related Nodes” finds the most similar concepts (or assertions) to the seeds and expands the graph. Figure 3 shows a typical situation where there is a bug in the knowledge. The incorrect assertions, “Cup IsA Drink” and “Bottle IsA Drink” were inferred from the incorrect KB assertion, “Glass IsA Drink” (probably a failure of our parser on something like, “A glass of water is a drink”). Assertions around it like “Wine IsA Drink”, “Beer...” etc. are unaffected.
منابع مشابه
Visualizing high-dimensional posterior distributions in Bayesian modeling
In Bayesian modeling inference is based on the posterior distribution of the model parameters. The closed-form solution is seldom known and samples of the posterior have to be computed with Markov Chain Monte Carlo (MCMC) methods. The problem is that for large models the samples are highdimensional, and it is hard to piece together properties of the posterior. Our proposal is to use a non-linea...
متن کاملT-REX: a web server for inferring, validating and visualizing phylogenetic trees and networks
T-REX (Tree and reticulogram REConstruction) is a web server dedicated to the reconstruction of phylogenetic trees, reticulation networks and to the inference of horizontal gene transfer (HGT) events. T-REX includes several popular bioinformatics applications such as MUSCLE, MAFFT, Neighbor Joining, NINJA, BioNJ, PhyML, RAxML, random phylogenetic tree generator and some well-known sequence-to-d...
متن کاملctree: Conditional Inference Trees
This vignette describes the new reimplementation of conditional inference trees (CTree) in the R package partykit. CTree is a non-parametric class of regression trees embedding tree-structured regression models into a well defined theory of conditional inference procedures. It is applicable to all kinds of regression problems, including nominal, ordinal, numeric, censored as well as multivariat...
متن کاملVisualizing Scientific Inference
The sciences use a wide range of visual devices, practices, and imaging technologies. This diversity points to an important repertoire of visual methods that scientists use to adapt representations to meet the varied demands that their work places on cognitive processes. This paper identifies key features of the use of visualization in a range of scientific domains and considers the implication...
متن کاملThe Bayes Inference Engine
We are developing a computer application, called the Bayes Inference Engine, to provide the means to make inferences about models of physical reality within a Bayesian framework. The construction of complex nonlinear models is achieved by a fully object-oriented design. The models are represented by a data-flow diagram that may be manipulated by the analyst through a graphicalprogramming enviro...
متن کاملReporting an Experience: Improving the Feulgen Staining Technique for Better Visualizing of Nucleus
Among different staining methods used to demonstrate the nuclear abnormalities, Feulgen is one of the most reliable method. Feulgen staining is specific, sensitive method for evaluating the DNA damages.It has been shown that using non-DNA specific stains for monitoring the nuclear anomalies lead to false-positive or false-negative results. From self-experience, immersing the stained slides in h...
متن کامل